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Abstract 

This paper examines the applicabihty of Random Matrix Theory to portfoho man- 
agement in finance. Starting from a group of normally distributed stochastic pro- 
cesses with given correlations we devise an algorithm for removing noise from the 
estimator of correlations constructed from measured time series. We then apply this 
algorithm to historical time series for the Standard and Poor's 500 index. We dis- 
cuss to what extent the noise can be removed and whether the resulting underlying 
correlations are sufficiently accurate for portfolio management purposes. 

Key words: Random-matrix theory; cross-correlations; Econophysics; Numerical 
optimization 

PACS numbers: 02.50.SK,02.60.Pn,05.40.Ca,05.90.-hm 



1 Introduction 



The application of Random Matrix Theory to the analy s is of correlatio ns of fi- 



nancial time-seri e s has be e n widely studied b y Sengupta (Il997 ]^ Lalou: 
BouchaudI (Eoniii) : IPleroiJ (ll999L \2Qod. 120011): ISurdal (|2nn3t ): iMantegn, 



I Jtsugil (|2003t ) : iMalever g'ne 
matrices and bv iGalluccio 





: IPaflka feOO?" 2003") in the case of Gaussian 
Jurkiewica ((200^); ISornettel (|2000>) in the 



non-Gaussian case. The non-Gaussian case is more realistic but also more dif- 
ficult. One of the reasons of these difficulties is that it is not straightforward 
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to use variances and covariances as a measure of risk in the non-Gaussian 
case as it is in the Gaussian case. ISornett3 ()200Cll ) approaches this problem by 



mapping asymptotically power-law distributed assets in a non-linear way onto 
Gaussian variables whose covariances provide a new measure of dependence 
between non-normal returns. 

In this paper, however, we are go ing to forget that p r ice returns have "fat 



in tnis paper, nowever, we are go ing to lorget tnat p r ice returns nave lat 
tails, as it was first suggested by Mandelbrot ( 19631 ): EugFamal ( 1963[ ) and 



then proved by Gopikrishnan ( 19981 ) in a thorough statistical analysis, and 



test the case of Gaussian Random matrices. Here it is the correlation matrix 
that measures the mutual dependence of normally distributed financial time 
series and this is the quantity we will investigate. 

In this ensemble of random matrices there are several universal quantities, 
i.e. quantities which are the same for both a particular random matrix and 
the average over the whole ensemble. In principle then we can measure these 
quantities for a financial correlation matrix, compare them with those from 
the ensemble, which will be defined in the next section, and then extract 
information about the underlying correlations between stocks. 

One universal quantity is the eigenvalue distribution. The density of eigen- 
values for financial correlation matrices seems to be divided into two or more 
clearly separated parts (see Fig. 5). The first part, comprising the majority 
of the spectrum, corresponds to small eigenvalues. The other parts contain 
very few eigenvalues that are generally much larger than those from the first 
group. The components of these other groups are clearly separated from the 
first group and from one another. Since eigenvalue spectra in the ensemble 
have a similar structure the method consists in modifying the spectrum, re- 
placing each group of eigenvalues by groups of degenerate eigenvalues and 
reconstructing the correlation matrix from that modified spectrum and the 
eigenvectors of the original financial correlation matrix. 

This method is valid only under certain assumptions. 

(1) The financial time series should comprise Gaussian stochastic processes 
and 

(2) the underlying matrix of stock correlations should have only a small num- 
ber of distinct eigenvalues Aj for i = 1, . . . ,V that are clearly separated 
from one another Ai ^ A2 . . . ^ Ap. 

In this paper we examine the method for Gaussian distributed pseudo-random 
variables conforming to given correlations. We test the extent to which one 
can reproduce the underlying correlations from Gaussian time series of a given 
finite length. Furthermore, we apply the algorithm to historical time series of 
the Standard and Poor's 500 inde x and compare t he results to a different 
"denoising" algorithm suggested bv iBouchaud (|2000l) . 
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The paper is organized as follows. In section 2 we briefly discuss the theoret- 
ical approach, in section 3 we describe the procedure of fitting the theory to 
measured data. 



In the sections 4 and 5 we discuss the simulation and the results for financial 
data. In section 6 we discuss whether the amount of noise removed from the 
measured correlation matrix of Gaussian processes is sufficient for the cleaned 
matrix to be used for portfolio management purposes. 

We conclude with a brief discussion. 



2 Theoretical formalism 



Consider a ensemble of normally distributed stochastic processes X = {Xi^t} 
for i = 1, . . . ,N and t = 1, . . . , T. whose joint probability density reads: 



T / N \ 

p (X) = n hP , • • • , x^,t) n dXn,t 

t=l \ n=l ) 



1 



T N N 



N*T 



-^^M-l^T.llllx^,tC■jx,,) n dx^, 

^ t=l i=\ j=l n,t=l 



-J\f exi){—-Tr 



x^c-^x 



)DX 



with a normalization factor JV that ensures 
/P(X) DX = 1. 

The estimator of correlations in these processes is defined in the usual way: 



A — — XX^ 



t=l 



(2) 



It follows from this definition that the processes Xi^t and Xj^t' with different 
indices i ^ j are correlated only if the times are the same t = t', i.e. 



{Xi^tXj^t') — Cij6t^t' 



(3) 



Now we can calculate analytically certain a verages oyer the ensemble of stochas- 
tic proc esses Xjt called Wishart rnatrices Wishart ( 1928[ l. In particular it is 
possible iFeinberd (|l997l l: ISenguptal ()l997t ). to obtain a closed form expression 
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for the trace of a statistically averaged matrix that involves an estimator Qj- 
of the cross correlations. This resolvent is defined as follows: 



Giz) 



Tr\z — c 



-1 



Tr 



1 

z XX^ 

T 



-1 



P (X) DX 



(4) 



where the integral on the right-hand-side stands for N xT integrals over each 
of the variables Xa in the limits from — oo to oo. The result reads: 



G{z) 



T 



z — 



where = Tr 



C 



1 - CGiz] 



(5) 



In the case when the underlying correlation matrix C has only few distinct 
eigenvalues Oj with multiplicities "Pj for j = 1, . . . , "P the resolvent satisfies an 
algebraic (P + l)-order algebraic equation of the form: 



T 



V 



Giz) 



G{z) - K{r 



where Ki < K2 < ■ ■ ■ < i^v 



(6) 



where = 1 /cr| The following two constraints are imposed on the parameters 
Pi- Y.^=iPi = N and Y^=iPi/ f^e, = N. These follow from the fact that the 
correlation matrix C has a dimension and a trace N . The resolvent is a 
useful tool for calculating the density of eigenvalues 



Pa(A) = (E^(^-^^) 



(7) 



of the estimator of the correlation matrix Cjj in the limit 



00. The 



density of eigenvalues pa(A) is related to the resolvent as follows: 



Pa(A) 



27r 



lim,_o/"^ [^(A + ie) - G(A - ie)] 



and is universal, which means that it doesn't matter whether we perform the 
average over the ensemble of Wishart matrices in (7) or whether we drop the 
brackets () and compute the sum over eigenvalues of a particular representative 
of the ensemble. Numerically, the density of eigenvalues Pa (A) is computed by 
solving the algebraic equation (6) for G{z), taking the imaginary part of the 
solution and dividing it by vr. It is not difficult to realize that the imaginary 
part of the resolvent Pa(A) is different from zero only if A belongs to certain 
intervals A(«) - 5^^^ < A < A«) + 5^^^ such that G(A(«)) ~ k^T (see Fig. 13). 

For more details o f this a pproach w'e refer the interested reader to original 
works by iFeinbergI ()l997l ) ; ISenguptal ()l997l ) . We now discuss how to fit the 
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density of eigenvalues pa(A) in the ensemble of random matrices to pf{X) 
corresponding to cross-correlations of financial time series. 



3 The fit algorithm 



As mentioned in the introduction, eigenvalue spectra of cross-correlation ma- 
trices of financial time series consist of clearly separated groups of eigenvalues 
to be referred to as bands. Each band corresponds to one term in the sum on 
the right-hand-side of equation (6). The parameters and of the resolvent 
are determined, roughly, by the number of eigenvalues in the ^th band and by 
the location of the midpoint of the band. We calibrate the parameters of the 
resolvent against market data in the following manner. After computing the 
eigenvalues of the estimator of correlations (2) we calculate the histogram of 
eigenvalues and the smoothed density of eigenvalues P\{X) which is defined as 
a sum of narrow Gaussians pin-pointed at all eigenvalues, i.e 

T N 1 2 

PfW = T7E'^^(A-A.) where S,{x) = ^==exp{-} (9) 



The smoothed density pf (A) is used to find the preliminary location of bands, 
i.e. their midpoints I/k^, and the numbers of eigenvalues belonging to bands 
p^. Having determined the parameters we compute the resolvent G{X) from 
equation (6) by finding complex roots of a polynomial equation (computations 
are carried out with multiple precision due to high susceptibility of polynomial- 
roots to errors in coefficients) and then taking the imaginary part to get the 
Pa(A) in the ensemble of Wishart matrices. Finally we compute the optimal 
parameters k^, subject to the constraints, by minimizing the deviation between 
PaW ^^'^ Pa (A) using a down-hill simple x method - a multi-d i mensi onal min- 
imization routine from Nelder and Mead Numerical Recipes ( 1988| ). The pa- 
rameters p^ are not altered in the procedure. The minimization is carried out 
in P — 1-dimensional space. Specifically we minimize the sum of relative differ- 



ences of moments: 5 = Y/j 



=1 



E 



I (A^) where (A^) = pA{X)dX 



■T- 

and the "empirical" mean (A-^)^ refers to the density of eigenvalues of the 
measured estimator of correlations. 



From the optimal parameters p^ and we reconstruct the spectrum by re- 
placing the eigenvalues of the estimator Cj^ by 1/ ki with degeneracy pi, 1/^2 
with degeneracy p2 and so on until l/n-p with degeneracy p-p. Having done 
that we construct the "cleaned" correlation matrix cf J"" by multiplying a di- 
agonal matrix D with the new eigenvalues on the diagonal by two matrices on 
the right and left side respectively. On the right-hand side we have a matrix 
V = {vi, . . . ,vn) whose rows are eigenvectors of Qj, on the left-hand side we 
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have the transposition V"^. It means that cfj°"' 



J2p ^pVp,iVp,j- 



Finally we 



check the "degree of cleaning" , which we define as the deviation between the 
estimator and the cleaned estimator ^ — 



4 Testing the algorithm by simulation 



In order to check the reliability of the output of the algorithm applied to real 
data we perform Monte-Carlo simulations. At each time step t we generate 
N correlated Gaussian processes Xi{t) with given cross-correlations Cij. This 
is done by generating at each time step N uncorrelated Gaussian pro cesses 
Yi(t), by means of the Box- Mueller algorithm [Numerical Recipes I ()l988[ l. and 
building from them linear combinations Xi{t) = J2p=i ^i,p\/\>Yp{t) with co- 
efficients that depend on eigenvalues Xp and eigenvectors V^.p of the underlying 
correlation matrix Cij. 

Now we compute the estimator of the correlation matrix (2), diagonalize it 
and compute the smoothed density of eigenvalues pf{X) from formula (9) with 
e of the order of the mean level spacing between consecutive eigenvalues. The 
results are shown in Fig. 1. Having obtained the preliminary estimates of pa- 
rameters and Kg we carry out the rest of the computation, as described 
at the end of the previous section, and obtain the "cleaned" correlation ma- 
trix c*^^^"". We quantify the "goodness of cleaning" by comparing the mean 



deviations 



^clean ^ 



and |c — C|. 



The dependence of the "goodness of cleaning" on the length of time series T 
is shown in Figs. 2,3. We can see two desirable features in the figures. Firstly 
the deviation |c — C| of the estimator from the underlying matrix decreases 
with increasing length of time series. This has nothing to do with the clean- 
ing procedure and is only a manifestation of the fact that the estimator Cij 
approaches the estimated matrix Cj.-,- monotonously as N/T — > 0. Secondly 
the "goodness of cleaning" improves with decreasing N/T (see inset in Figs. 
2,3). Indeed it drops from 0.85(0.74) to 0.74(0.7) in Figs 2 and 3 respectively. 
We can, therefore, say that for Gaussian processes the noise filtering algo- 
rithm removes 15%-26% and 26%-30% noise N/T = 0.1 - -0.01 for the both 
correlation matrices tested in in items (1) and (2), sec. 4 respectively. 

Finally a remark with respect to the choice of the underlying correlation matrix 
Cij. Since the magnitudes of the eigenvalues of Cij for financial correlation 
matrices differ considerably we need to take the underlying correlation matrix 
with the same property. We require Cij to have three distinct eigenvalues 
Ai -C A2 -C A3, with degeneracies — 7, 4 and 3 respectively, such that 
A2/A1 = 50 and A3/A1 = 100. Since the trace of the correlation matrix is A^ 
there are certain constraints imposed on the A's. Now, we take two choices of 
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Eigenvalue-Histogram 
Mildly Smoothed Eigenvalue-Density - 
Strongly Smoothed Eigenvalue-Density - 




Fig. 1. The histogram of eigenvalues and the smoothed densities of eigenvalues for 
two smoothing parameters for a block- matrix Cij defined in in item (2), sec. 4. . 
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Cjj, namely: 



1^1) Cij = f{i—j) with f{i) being a Fourier transform of a piece-wise constant 
function with three distinct values. 
Here C = 100^-^' + E^li 2/(A;) cos {27rk{i - j) /N} where 



~m 



1 if 1 < A; < 46 
50 if 47<A;<48 
100 if 49 < A; < 50 



(10) 



and /(lOO — A:) = f{k). The matrix C which has three distinct eigenvalues 
(Ai = 0.1686340641, 50Ai, lOOAi) with degeneracies (93, 4, 3) respectively. 
(2) A block-matrix that consists of S blocks of dimensions for = 
1, ... ,5 on the diagonal and zeroes elsewhere. The blocks itself have a 
structure 





^ ICC . 


.c\ 




CIC . 


.c 




CCl . 


.c 




^ccc . 





(11) 



where — 1 < C < 1. This choice is dictated by the fact there may ex- 
ist groups of stocks on the market, which have the property that two 
stocks arc only correlated if they belong to the same group. This means 
that (X^^iXj^j) = S^^rfii.j for i,j = 1, . . . and ^,77 = 1, . . . , S. In the 
following we choose Cij = C for simplicity and also because it renders 
the smallest eigenvalue of all block matrices to be the same. The min- 
imal dimension of such matrix Cij that has three distinct eigenvalues 
-^1) -^2; A3 with degeneracies — 7, 4 and 3 and for which A2/A1 = 50 and 
A3/A1 = 100 is = 493. The matrix contains four blocks of dimension 
rii = 49 and three blocks of of dimension 77,5 = 99 and the parameter C 
in each block is C = 1/2. 



5 Results for financial data 



Having tested the algorithm on Gaussian stochastic processes we now apply 
it to daily financial data from the Standard and Poor's 500 stock group. Here 
we check if the eigenvalues A and their degeneracies V, which we obtain as a 
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Fig. 2. Results of a test of the cleaning algorithm for N = 100 correlated Gaussian 
stochastic processes for the underlying correlation matrix in item (1), sec. 4. Plot- 
ted are deviations 5 = Yli j \ ~ between the underlying correlation matrix 
C and the estimator of it c(c^) before and after applying the "cleaning procedure" 
respectively. The inset includes a "degree of cleaning" , i.e. ratio of the deviations af- 
ter and before applying the denoising procedure. The "degree of cleaning" improves 
with the length of time series. 




1000 




10000 
Length of^me series T 



result of the mii iimization pro cedu re described abov e, correspond to eigenval- 
ues obtained by Laloux ( 19991 ) and iBouchaudI ^200(1 . What one did there was 



to order the eigenvalues in ascending order, divide them into groups (there 
were usually only two groups, the "noisy part" containing most eigenvalues 
and one separated eigenvalue much larger than the rest) and replace the eigen- 
values from the group by one degenerated eigenvalues equal to the mean over 
eigenvalues from the group (in order to conserve the trace of the correlation 
matrix) . In other words the group of eigenvalues A , Ajj , • • • , Aj^ was replaced 
by one eigenvalue (A) = /p with a degeneracy p. 

We have performed computations for groups of stocks with a fixed length 
T = 3000 days and with a growing number of stocks N. The optimal eigenval- 
ues and their degeneracies are shown in Table 1. As we can see the al gorithm 



i s stab le, i.e. it produces results which are close to that (A) obtained bv lLaloux 



f|l999l ) "or different values of A^. In other words we have shown that the min- 



imization of deviations between the spectrum of eigenvalues of the measured 
estimator of cross-correlations and the spectrum of eigenvalu es of a c ertain 
ran dom matrix amounts, approximately, to the procedure by iLaloux (|l999l) 



and iBouchai^ (|2000>) . i.e. to replacing each group of eigenvalues by one sin- 



gle, degenerated eigenvalue whose value is equal to the mean over the group. 
However, as we can see in Table 1, the eigenvalues A and (A) do not match 
exactly. This is due to the fact that sometimes it is difficult to define a band 
(see Figs. 5-12) in the spectrum of the measured estimator. This depends on 
the smoothing parameter e in (9). Moreover, it is not quite clear that the 
minimization routine will lead us to the global minimum. If there are more 
than V = 2 bands then it is quite likely for the algorithm to get stuck in a 
local minimum, which will be hard to distinguish from the global minimum. 
Indeed, in the last row in Table 1 the underlying eigenvalues A's and the mean 
eigenvalues (A) do not match but even though the deviation 6 is of the same 
order than in previous rows. 

From Table 1 we see that the minimization was convergent in a reasonable 
time (the number Af of function evaluations was reasonable) even if there were 
several parameters to be optimized. 



6 Optimal portfolio 



According to Markovita ( 19591 ) the optimal portfolio P = {pi, . . . ,pn} of in- 
dependent, correlated Gaussian stochastic processes, that ensures the smallest 
possible risk Dp = J2i,jPiPjCij subject to the constraint J2iPi = 1 reads: 



Pi 



(12) 
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N ^^S"" °* X{V) (A) (P) Ic-c'^'^""! S V-\ M 



100 


AA- 


-CMS 


0.8548(99) 
15.3729(1) 


0.8544(99) 
15.4158(1) 


0.02895 


0.0902 


1 


49 


150 


AA- 


-EP 


0.8062(148) 

7.6541(1) 

23.0255(1) 


0.8216(148) 
5.3063(1) 

23.0968(1) 


0.02732 


0.0236 


2 


134 


200 


AA- 


HLT 


0.8088(198) 

9.8260(1) 

30.0408(1) 


0.8249(198) 

6.5286(1) 

30.1357(1) 


0.02666 


0.0228 


2 


100 


250 


AA- 


-LTD 


0.8068(248) 

12.1353(1) 

37.7897(1) 


0.8235(248) 

7.8638(1) 

37.9071(1) 


0.02661 


0.0248 


2 


154 


300 


AA- 


-NYT 


0.7862(296) 
7.0621(2) 
7.6146(1) 
45.5366(1) 


0.7899(296) 
5.5244(2) 
9.4625(1) 
45.6709(1) 


0.02049 


0.0295 


3 


124 



0.7713(345) 0.7750(345) 
2.3814(1) 4.6842(1) 



350 


AA-SCH 


7.2660(2) 

14.1737(1) 
52.8160(1) 


6.7622(2) 

11.4487(1) 

52.9780(1) 


0.01904 


0.0112 


4 


293 


400 


AA-UCL 


0.7692(395) 

2.4292(1) 

10.6966(2) 

12.7553(1) 

59.5976(1) 


0.7776(395) 

5.1295(1) 

7.5616(2) 

12.8304(1) 

59.7666(1) 


0.01969 


0.0099 


4 


259 






0.7718(430) 
15.8633(1) 


0.7793(430) 
5.6098(1) 










435 


AA- 
ZION 


5.7546(2) 

10.9826(1) 

64.7905(1) 


7.9069(2) 

13.4956(1) 

64.9812(1) 


0.02611 


0.0101387 


4 


409 



Table 1 



Eigenvalues and degeneracies A and T> of the underlying correlation matrix mean 
eigenvalues (A) in the consecutive bands, the sum of relative differences of moments 
5 of the empirical correlation matrix and the corresponding "stochastic" matrix, 
i.e. that of the Wishart matrix ensemble, the deviation |c — c*^'^""! of the "cleaned" 
estimator from the measured estimator, the number of parameters V — 1 optimized 
and the number of function evaluations N in the downhill-simplex method needed 
to achieve a tolerance of 10~^. Data correspond to daily historical times series 
(r = 3000 days) of different groups of stocks of the Standard and Poor's 500 group 
of shares. 



Practical applications of this recipe are limited because the weights Pi are very 
sensitive to perturbations of the matrix elements Cjj. We have tested the sen- 
sitivity in the following simple experiment. Having perturbed the correlation 
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matrix Cj^ with Gaussian noise with mean zero and variance o;^, i.e 

Cj. = Q,, + an,, (13) 



where T^jj = Normal{0, 1) we measure the average relative variation 
2 \pf - Pi 



in the Markovitz' weights defined in (12). Here pi and pf are related to the 
matrices Cij and Cj^j respectively. The results are shown in Fig. 4. As we can 
see an addition of only 1% noise {a = 0.01) makes the Markovitz' weights 
change already by 25% (A = 0.25). On the other hand in Fig. 3 we see that 
the estimator of correlations differs from the underlying correlation matrix 
Cij by approximately 0.8%-1.2% for N/T = 0.05-0.1. In sec. 4 we concluded 
that the algorithm removes on average 25% noise once the ration N/T < 0.1. 
Therefore the noise contamination of the "cleaned" correlation matrix reduces 
to 0.6%-0.9% and this gives us, from Fig. 4, a decrease of the variation A in 
Markovitz weights 0.13-0.17 and 0.22-0.3 in case of the smallest and largest 
values of N/T respectively. We see that the relative percentage variation in the 
Markovitz-weights after applying the "cleaning" procedure is about (0.17 — 
0.13)/0.17~ 25%. 
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Variation of IVIarkowitz-weiglits under addition of Gaussian noise 
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a 

Fig. 4. The average relative variation in the weights of stocks (Markovitz-weights) 
under addition of independent Gaussian noise with mean zero and variance to 
the correlation matrix defined in item (2), sec. 4. 



7 Conclusions 



We have pro ved that th e quali tative cross-c o rrelat ions' "cleaning" procedure 
suggested by iBouchaudI tOOij l and iLalouxl corresponds to a numer- 

ical minimization problem, namely to the problem of minimizing deviations 
between the eigenvalue spectrum of the measured cross-correlations and the 
spectrum of a random matrix. Moreover, we have shown qualitatively that if 
the ratio N/T < 0.1 the algorithm filters out roughly 25% noise from Gaussian 
distributed stochastic processes, which corresponds to a percentage correction 
of the Markovitz' weights by roughly 25%. Quantification of this statement 
will be desirable. 



Future work will be devoted to developing an analysis of cross-correlations be- 
tween stocks in a non-Gaussian market where distributions of returns exhibit 
power-law tails. 
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Histogram 
Smoothed Density 



lm[G{z)] 
Underlying Eigenvalues 



Fig. 5. The histogram of the density of eigenvalues of the correlation matrix of 
the first hundred stocks AA-CMS from the Standard and Poor's 500 group, the 
smoothed density /Oe(A), the imaginary part paW of the Greens function and the 
eigenvalues of the "cleaned" correlation matrix Cij. The empirical moments are 
= (1.000, 1.000, 3.314, 38.064, 567.738, 8713.976, 134235.952, 2069070.046, ) , 
the moments corresponding to the resolvent are 

M = (1.000, 1.000, 3.120, 37.267, 564.756, 8701.178, 134235.871, 2071710.416) . 




1.5 



Histogram 

Smoothed Density □ 



22.7 22.9 23.1 23.3 
lm[G{z)] «- 
Underlying Eigenvalues ' 



2 



T &rQ^ 



Ti 1 1 n 1 1 1 — r 




Fig. 7. The same as in figure Figs. 5,6 but for the first 200 S&P500 stocks. 




Fig. 8. The same as in figure Figs. 5,6, 7 but for the first 250 S&PSOO stocks. 
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Fig. 10. The same as in figure Figs. 5-9 but for the first 350 S&:P500 stocks. 
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64 64.5 65 65.5 66 

Histogram lm[G{z)] ^ - 

Smoothed Density □ Underlying Eigenvalues 

Fig. 12. The same as in figure Figs. 5-11 but for the first 435 S&:P500 stocks. 
In contrast to the previous cases here the underlying eigenvalues do not match the 
midpoints of the bands but even though the deviation 6 = \pa{^)—P\W\ is minimal 
(compare Table 1). i y 




Fig. 13. Illustration of how the imaginary part of the resolvent pa(A) defined in 
(8) and (6) splits into several bands when > 1. In each row the Figure on the 
left-hand side shows a plot of function z = f{G) zoomed out in the vicinity of a 
singularity. On the right-hand side we plot the imaginary part of the resolvent pa(A) 
as a function of A (marked with diamonds) overlaid with the empirical density of 
eigenvalues (asterisk) . Vertical beams ended with a squares denote eigenvalues 1/k^ 
of the cleaned correlation matrix Cij and their degeneracies (compare equation 
(6)). The singularities of the function z = f{G) have been chosen in order to achieve 
a best conformance of /9a(A) to market data of the AA-ZION stocks from the 
Standard and Poor's group. 
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